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INTRODUCTION 

One major function of the carbohydrates in glycoproteins and glycolipids is to 
present information in the form of three-dimensional structures. The information in 
oligosaccharide structures is decoded through the binding of carbohydrate recognizing 
proteins called lectins. Involvement of lectins in diverse biological phenomena, from 
intracellular routing of glycoproteins to cell-cell adhesion and phagocytosis, has been 
reported (1). Especially significant is the progress made in elucidating the role of 
selectins, a subgroup of C-type lectins, in inflammation (2-4). Under physiological flow 
conditions, leukocytes interact with activated vascular endothelium through leukocyte 
rolling, followed by firm adhesion, spreading, and extravasation. Selectins, which bind to 
oligosaccharide structures called sialyl Le x and sialyl Le a , were shown to participate in 
the rolling process. Since both leukocytes and tumor cells, in particular carcinomas and 
leukemias, express these oligosaccharides on their cell surface, it was postulated that 
tumor cells might use selectin-carbohydrate interaction during metastasis. Although 
limited, existing experimental data support this hypothesis. Carbohydrate-protein 
interactions are not restricted to selectins. Other lectins may be used by tumor cells 
when interacting with carbohydrates, and these lectins may also play an important role 
in progression and metastasis. A significant difference was observed in the degree of 
glycosylation of the mucin, as well as in the N-glycosylation pattern of glycoproteins 
between normal and cancer breast cells (5-8). Therefore, to explore that possibility 
breast cancer was chosen as the prime target of our study. 

The primary goal of this proposal is to identify lectins that are either produced by 
breast cancer cells or that are capable of binding to cell surface carbohydrate structures 
of the breast cancer cells. In order to identify these lectins we chose to employ a 
powerful technique known as phage display (9-12). Use of the recently introduced T7 
phage system allowed us to express proteins up to 1200 amino acids, which might be 
sufficient to contain the carbohydrate-recognition domain (CRD) of lectins. 

BODY 

Task 0. To prove the feasibility of phage display strategy in cloning cDNA encoding 
proteins with carbohydrate-binding property 

From our preliminary experiments with a HeLa cell phage display cDNA library 
and an isolated blood group H-specific glycoprotein fraction, we obtained results 
suggesting the possible utility of the phage display strategy in identifying carbohydrate 
binding proteins in vitro. By using this glycoprotein fraction as bait, amplification from 
the library of several phage clones expressing sequences (13-16) from galectin-3 was 
observed. Galectin-3 is a member of S-type lectins (17-19), and an affinity with the 
blood-group substance has been reported (19, 20). Before expanding our research to 
breast cancer, we proved that amplification of galectin-3 phage display clones by in vitro 
biopanning was not incidental but rather selective. Southern hybridization as well as 
plaque hybridization was used to prove the selectivity. Results from this portion of work 
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have been published (Yamamoto, M., Kominato, Y., and Yamamoto, F. (1999). Phage 
display cDNA cloning of protein with carbohydrate affinity. Biochem. Biophys. Res. 
Commu. 255: 194-199), and the reprint is located in the Appendix. 


Task 1. To obtain phage display cDNA clones which specifically bind to isolated 
glycoproteins and cell surfaces 

* Construction of phage display cDNA libraries with more than one million independent 
clones 

We have constructed a total of 11 phage display cDNA libraries in T7Select 1-1 vector, 
where eight of these libraries contained more than one million independent clones. The 
size and titers of the amplified libraries were determined and listed in Table 1. 

* In vitro biopanning to isolate clones from phage display cDNA libraries 

In addition to the blood group glycoprotein fraction from gastric mucosa, we used 
commercially available purified glycoprotein and glycoprotein fractions for in vitro 
biopanning experiments. 

The list of glycolipids and glycoproteins used for in vitro biopanning are as follows. 
Glycoproteins: 

Blood group H-expressing mucin fraction from gastric mucosa, otl-Acid Glycoprotein 
from human serum, Fibrinogen from human plasma, extracellular matrix glycoprotein 
fraction of zonae pellucida from porcine ovary, and sialyl Lewis x-BSA 

Glycolipids: 

Trihexosylceramide (CTH), Asialoganglioside-GMI, Asialoganglioside-GM2, 
Monosialoganglioside-GMI, Monosialoganglioside-GM2, and Monosialoganglioside- 
GM3 

MaxiSorp 96-well plates were coated with glycoproteins and used for binding selections. 
Some glycolipids were coated onto 96-well PolySorp plates while other charged 
glycolipids were spotted onto a polyvinylidene difluoride (PVDF) membrane (21). 

* In vivo biopanning to isolate clones from phage display cDNA libraries 

In vivo biopanning experiments were performed using EJG bovine capillary endothelial 
cells and MCF-7 and BT-20 human mammary carcinoma cells as baits. 

* In situ biopanning to isolate clones from phage display cDNA libraries 

Although freshly frozen breast cancer tissue sections, together with normal breast tissue 
sections from the same patients, were obtained from the University of Michigan Breast 
Cancer Tissue Bank, we have not yet started in situ biopanning experiments. 

Task 2. To characterize the selected phage populations as well as isolated individual 
clones 
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‘Monitoring the enrichment of certain phage clones by PCR using lysates from selected 
population of phages, determining nucleotide sequences cDNA inserts, and identifying 
the clones of interest 

In vitro biopanning 

After the fifth or fourth round of in vitro biopanning, PCR amplification of cDNA inserts 
was performed and the PCR products were analyzed by agarose gel electrophoresis. 
Several discrete bands were observed from each selection, and the selections were 
repeated using this variety of glycoproteins and glycolipids as baits. These bands were 
excised from the gel, DNA was gel-purified, and the nucleotide sequences of the cDNA 
inserts were determined directly or after cloning into the plasmid vector. So far, 
nucleotide sequences of 89 bands were determined (Table 2). Homology search was 
performed using the BLAST program. Unexpectedly however, none of the phages 
contained the cDNA sequences encoding the same proteins except that some clones 
encoding galectin-3 were obtained when blood group glycoprotein was used as bait. 
Therefore, specific binding has not yet been proven for any candidate clones. 

In vivo biopanning 

Different from the above-mentioned in vitro biopanning experiments, no discrete bands 
were observed after the sixth round of selection. Apparently many phage clones bound 
to the cell surface (and/or cell culture plate substratum). We next examined whether 
phages could be further screened based on stimulatory/inhibitory activity on cell growth. 
For this purpose, 6x48 clones (may not be completely independent) were randomly 
selected from the fifth/sixth rounds of EJG cell selections and infected to the host 
bacteria. Phage lysates were then added onto the EJG cell culture in 24-well dishes in 
duplicate. After 72 hours of incubation, cell growth was monitored by MTT assay (22). 
Three clones each showing the highest or the lowest values in duplicate MTT 
experiments were selected out of the respective set of 48 clones. Therefore, a total of 
36 clones were further analyzed in the second round of MTT assays. We anticipated 
that those phage clones would exhibit the same tendency as observed in the first round 
forming two separate populations (one with stimulatory activity and the other with 
inhibitory activity on EJG cell growth). Disappointing results were obtained. Some of the 
clones previously identified to enhance the cell growth in the first round of selection 
turned out to repress the growth in the second round, and vice versa. We went on the 
next screening hoping to identify several true candidates out of incidentally selected 
clones, but the third selection was not successful either. Again, no separation between 
growth stimulatory clones and growth inhibitory clones was observed. Results were 
random and not reproducible. Considering that the phage titers were approximately 
5x10 7 pfu/fxl and 500 ^1 of lysate was added to a 2ml of cell culture, the concentrations 
of the fusion proteins (5x10 7 x 500 molecules/2.5ml = 10 13 molecules/l = 10 13 /6.02x10 23 
mol/I = 17 xIO' 12 mol/l = 17pM) may have not been sufficient to induce growth changes. 
In order to achieve higher concentration of fusion proteins, transfer of the cDNA inserts 
from phage display vector to a eukaryotic expression vector may be necessary. We 
sequenced 72 phage clones from the population selected with EJG cells as bait, and 
they are listed in Table 2. 
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Task 3. To characterize the identified genes and their gene expression 
*Performing Northern hybridization and/or RT-PCR to determine the mRNA level using 
RNA from normal and tumor breast cell lines, and performing Southern hybridization 
using DNA from normal and tumor breast cell lines 

We have not yet identified candidate clones interesting enough to pursue further. Once 
those candidates are identified, we will characterize those genes in detail. 

KEY RESEARCH ACCOMPLISHMENTS 

• Demonstration of the feasibility of the phage display technique to clone cDNA of 
carbohydrate-binding protein 

• Isolation and identification of phage display clones which bind to the purified 
glycoproteins and glycolipids 

REPORTABLE OUTCOMES 

• A manuscript was submitted to, accepted by, and published in Biochem. Biophys. 
Res. Commu. reporting the feasibility of phage display approach to clone cDNA 
encoding carbohydrate-binding protein. 


CONCLUSIONS 

Although we could prove the feasibility of the phage display technique to clone 
carbohydrate-binding proteins, we have not succeeded in identifying previously 
unrecognized lectin molecules. In vitro biopanning produced many candidates, 
however, none of these clones came from the cDNA encoding the same proteins except 
for galectin-3. Therefore, it will be necessary to narrow down the number of candidates. 
Results from in vivo biopannning suggested the necessity of functional analysis. 
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APPENDIX 


Table 1. List of phage display cDNA libraries 


MCF-7 Breast adenocarcinoma 

MCF-7 Breast adenocarcinoma 

PC-3 prostate adenocarcinoma 

PC-3 prostate adenocarcinoma 

HeLa cell transfectant 

HeLa cell transfectant 

HeLa cell transfectant 

EJG bovine capillary endothelial cell 

Human liver 

Human liver 

Human lymph node 


1.06x10 5 clones 
1.26x10 7 clones 
1.94x10 5 clones 
1.63x10 7 clones 
1.77x10 s clones 
3.05x10 s clones 
5.29x10 s clones 
1.54x10 7 clones 
4.68x10 5 clones 
2.55x10 7 clones 
4.63x10 s clones 


4x10 7 pfu/pl 
7.23x10 7 pfu/pJ 
4.4x10 7 pfu/|j] 
2.80x10' pfu/j_U 
3.9x10 7 pfu/pJ 
3.25x10 t pfu/j_U 
3.44x10 7 pfu/pJ 
2.80x10 7 pfu/fjJ 
5x10 7 pfu/pl 
3.48x10 7 pfu/pl 
5.27x10 7 pfu/pl 


The number of the independent clones and the titers of the amplified libraries are shown. 


Table 2. Sequence homology of the phage clones obtained by selections 

(I). In vitro selection 

A. Glycoproteins 

1. Blood group H-expressing mucin fraction 


M 

X89718 

1.9e-16 

Putative 26S protease subunit mRNA 

M 

AA134606 

l.le-88 

EST 

M 

X98614 

2e-92 

cytokeratin 8 mRNA 

M 

AC002558 

e-114 

chr. 17 identical 

M 

AA627181 

e-108 

EST 

M 

X06617 

e-128 

ribosomal protein SI 1 mRNA 

M 

AI394640 

e-153 

EST 

M 

J05176 

e-113 

alpha 1-antichymotrypsin mRNA 

M 

X90826 

e-170 

USF2 mRNA 

M 

AA040063 

e-162 

EST 

M 

D38112 

6e-32 

mitochondrial DNA 

MPELiLy 

Y13710 

0.0 

alternative activated macrophage-specific CC cytokine mRNA 

MPELiLy 

T89015 

2e-19 

KIAA 0220 mRNA 

MPELiLy 

AA225548 

3e-30 

EST/Alu 

MPELiLy 

S416941 

6e-4 

PTMAP2 mRNA 

MPELiLy 

L41066 

7e-53 

NF-AT3 mRNA 

MPELiLy 

U90426 

0.0 

nuclear RNA helicase mRNA 

MPELiLy 

AA83124 

3e-15 

Alu 

MPELiLy 

AI315224 

2.2 


MPELiLy 

AA832124 

3e-15 

Alu 

MPELiLy 

Z54072 

e-17 

Alu 

MPELiLy 

AC004383 

0.24 


MPELiLy 

M36682 

e-36 

laminin-binding protein (galectin-3) mRNA 

MPELiLy 

AI253442 

6e-50 

cytochrome C mRNA 

MPELiLy 

AC002487 

4.9 


MPELiLy 

AC004896 

3e-12 

Alu 

MPELiLy 

H75348 

2e-25 


MPELiLy 

AA71349 

4e-l19 

EST 

MPELiLy 

AC004911 

2.1 


MPELiLy 

U63090 

3e-67 

alpha 2,3 sialyltransferase mRNA 

MPELiLy 

W33034 

2e-99 

EST 

MPELiLy 

R76396 

4e-23 

Alu 

MPELiLy 

AA804821 

e-60 

ATP synthetase mRNA 

MPELiLy 

AC005224 

3e-46 

Alu 

MPELiLy 

AA384311 

e-13 

EST 

MPELiLy 

AF042506 

4e-91 

cytochrome b mRNA 

MPELiLy 

AA327241 

0.008 


MPELiLy 

Y00097 

3e-36 

p68 protein mRNA 

MPELiLy 

AC00310 

6e-38 

Alu 

MPELiLy 

X75362 

0.014 


MPELiLy 

AA687598 

6e-54 

EST 


T JJ 
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MPELiLy 

S80305 

7e-29 

beta2 glycoprotein I mRNA 

MPELiLy 

U24576 

7e-84 

breast tumor autoantigen mRNA 

MPELiLy 

X02308 

e-136 

thymidylate synthetase mRNA 

MPELiLy 

U35451 

0.0 

heterochromatin protein p25 mRNA 

MPELiLy 

X93334 

0.0 

mitochondrial DNA 

MPELiLy 

AC003034 

e-120 

BAC clone ' 

MPELiLy 

Z84480 

0.002 


2. Sialyl Le x -BSA 



MPELiLy 

AF069378.1 

3e-82 

Insulin-like growth factor receptor II 

MPELiLy 

Z97353.3 

3e-55 

ribosomal protein L10 mRNA 

MPELiLy 

L06237 

5e-9 

microtubule-associated protein IB mRNA 

MPELiLy 

NM 001306.1 

4e-60 

Claudin 3 mRNA 

3. Zonae pellucida glycoprotein fraction 

MPELiLy 

U93305 

6e-10 

A4 differentiation dependent triple LIM domain protein mRNA 

MPELiLy 

NM 000970.1 

e-112 

ribosomal protein L27a mRNA 

4. Fibrinogen 




MPELiLy 

U63290 

0.54 


5. alpha 1 Acid glycoprotein fraction 


MPELiLy 

AA996393 

5e-7 EST 


B. Glycolipids 



l.CTH 




MPELiLy 

AJ000039 

3e-93 

G3PDH 

MPELiLy 

AF004338 

e-147 

16S rRNA 

MPELiLy 

VI1395 

0.0 

p40 protein mRNA 

MPELiLy 

D38524 

e-168 

5’ nucleotidase mRNA 

MPELiLy 

AJ130972 

0.0 

U2 sn RNP-specifrc A protein mRNA 

2. Asialo GM2 



MPELiLy 

AJ130972 

e-96 

U2 sn RNP mRNA 

MPELiLy 

AB012622 

3e-97 

Tapasin mRNA 

MPELiLy 

AF055004 

0.0 

KIAA0763 gene mRNA 

MPELiLy 

X04225 

e-112 

HCC alpha 1 microglobulin mRNA 

MPELiLy 

Z84480 

0.002 


3. Asialo GM1 



MPELiLy 

X86343 

e-157 

complement C7 mRNA 

MPELiLy 

ABO 12622 

3e-97 

Tapasim mRNA 

MPELiLy 

D88984 

e-132 

Ampd 3 gene 

MPELiLy 

S74678 

e-123 

heterogeneous nuclear RNP complex K mRNA 

MPELiLy 

M29064 

5e-06 

hnRNP B1 protein mRNA 

MPELiLy 

Z75894 

0.14 


4. GM3 




MPELiLy 

NM 003191.1 

4e-59 

thr-tRNA synthetase mRNA 

MPELiLy 

R78124 

1,7 


MPELiLy 

X85133 

3e-49 

p53 associated RBQ-1 mRNA 

MPELiLy 

AJ000039 

7e-40 

G3PDH mRNA 

5.GM2 




MPELiLy 

U96922 

3e-93 

inositol polyphosphate 4-phosphatase type II mRNA 

MPELiLy 

U68758 

0.0 

pyroline-5-carboxylate synthetase 

MPELiLy 

AF035429 

2e-97 

cytochrome oxidase subunit I mRNA 

MPELiLy 

AI093867 

e-144 

Hox A10 mRNA 

MPELiLy 

Y11395 

0.0 

p40 protein mRNA 

MPELiLy 

U011184 

e-112 

flightless gene product mRNA 

MPELiLy 

X56932 

2e-66 

23kD highly basic protein mRNA 

MPELiLy 

AC005303 

0.11 


MPELiLy 

Ml 1233 

e-112 

cathepsin O mRNA 

6. GM1 




MPELiLy 

X85786 

0.0 

DNA binding regulatory factor mRNA 

MPELiLy 

L05095 

0.0 

ribosomal protein L30 mRNA 

MPELiLy 

AA977240 

e-176 

EST 

MPELiLy 

D42039 

0.0 

KIAA 0081 gene mRNA 

(II). In vivo biopanning 



A. EJG cells 




MPH 

U14970 

3e-l 13 

ribosomal protein S5 mRNA 

MPH 

X73974 

4e-140 

ribosomal protein L4 mRNA 

MPH 

X01630 

3e-171 

argininosuccinate synthetase mRNA 

MPH 

AA7225105 

e-133 

Kappa casein precursor mRNA 

MPH 

S35960 

4e-155 

laminin receptor mRNA 

MPH 

AA4274237.1 

e-65 

ribosomal protein S16 mRNA 

MPH 

R51271 

0.18 



TT 
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MPH 

U57344 

2e-148 

homeobox protein Meis3 mRNA 

MPELiLy 

Z28663 

6e-30 

EST 

MPELiLy 

N58329 

7e-40 

EST/Alu 

MPELiLy 

AA523029 

4e-72 

ribosomal protein L9 mRNA 

MPELiLy 

H56276 

3e-51 

carbamoyl-phosphate synthetase mRNA 

MPELiLy 

W28842 

9e-89 

EST 

MPELiLy 

AC005368 

e-110 

chr.5 

MPELiLy 

M813IO 

0.43 


MPELiLy 

X56932 

0.0 

23KD highly basic protein mRNA 

MPELiLy 

AJ002030 

3e-54 

putative progesterone binding protein mRNA 

MPELiLy 

AA418401 

e-151 

EST 

MPELiLy 

AC005376 

1.7 


MPELiLy 

AA720919 

3e-14 

EST 

MPELiLy 

L29394 

e-164 

haptoglobin alpha 2FS beta mRNA 

MPELiLy 

AC002350 

3e-45 

Alu 

MPELiLy 

AC003100 

2e-15 


MPELiLy 

AA134339 

0.08 


MPELiLy 

AA195932 

0.0 

EST 

MPELiLy 

AA411274 

2e-97 


MPELiLy 

N58329 

9e-42 

EST/Alu 

MPELiLy 

AC002558 

0.0 

EST 

MPELiLy 

AC002300 

0.088 


MPELiLy 

U42386 

e-175 

fibroblast growth factor mRNA 

MPELiLy 

AL031228 

5e-68 

complement gene 

MPELiLy 

U00947 

2e-52 

(CAC)n/(GTG)n containing mRNA 

MPELiLy 

D26488 

4e-17 

KIAA0007 gene 

MPELiLy 

M10098 

4e-73 

18S rRNA 

MPELiLy 

H41283 

6e-07 

EST 

MPELiLy 

AB006198 

0.0 

SART-1 mRNA 

MPELiLy 

AA459599 

e-173 

EST 

MPELiLy 

U92980 

e-113 

DT1P1A10 mRNA 

MPELiLy 

X72889 

-126 

hbrm mRNA 

MPELiLy 

C04745 

e-147 

EST 

MPELiLy 

M55670 

e-177 

protein Z mRNA 

MPELiLy 

X62996 

0.0 

mitochondrial DNA 

MPELiLy 

AF001892 

5e-78 

MEN1 region epsilon/beta mRNA 

MPELiLy 

AD000813 

3e-6 

chr.19 

MPELiLy 

U66306 

e-121 

retinoid X receptor alpha mRNA 

MPELiLy 

H87626 

8e-27 

EST 

MPELiLy 

AL009181 

0.002 

PAC 

MPELiLy 

J02625 

e-165 

cytochrome P-450j 

MPELiLy 

X62153 

e-116 

PI protein mRNA 

MPELiLy 

Z33502 

0.007 


MPELiLy 

AF038186 

8e-95 

clone 23914 mRNA 

MPELiLy 

AB000517 

e-141 

CDP-diacylglycerol synthetase mRNA 

MPELiLy 

V00654 

2e-47 

mitochondrial DNA 

MPELiLy 

AA381346 

3e-5 

EST 

MPELiLy 

U65590 

6e-44 

IL-1 receptor antagonist IL-IRa 

MPELiLy 

U48705 

0.009 


MPELiLy 

Ml 8981 

e-100 

prolactin receptor associated protein mRNA 

MPELiLy 

D55674 

e-105 

heterogeneous nuclear ribonucleoprotein D mRNA 

MPELiLy 

L33264 

8e-35 

CDC2-related protein kinase 

MPELiLy 

AA405094 

e-123 

EST 

MPELiLy 

J01415 

0.0 

mitochondrial DNA 

MPELiLy 

X03559 

0.0 

Fl-ATPase beta subunit mRNA 

MPELiLy 

L06328 

e-166 

voltage-dependent anion channel protein mRNA 

MPELiLy 

ABO 10066 

e-34 

RBP56/hTAFII68 mRNA 

MPELiLy 

AL022098 

e-100 

chr.6 

MPELiLy 

Z69723 

0.16 


MPELiLy 

AF001542 

2e-53 

EST 

MPELiLy 

AF03694 

0.45 


MPELiLy 

AA964847 

1.5 


MPELiLy 

AC000114 

8e-56 

chr. Xq23 

MPELiLy 

U85088 

e-8 

coagulation factor XI gene 

MPELiLy 

AC005952 

5e-8 

chr. 19/Alu 


Abbreviation for the phage display cDNA libraries are as follows. M: MCF-7, P: PC-3, H: 
HeLa transfectant, E: EJG, Li: Liver, and Ly: Lymph node. Frame of the codons and orientation 
of the cDNA inserts varied. Therefore, it should be noted that homology in the nucleotide 
sequence does not necessarily implicate homology in the amino acid sequence. 
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Cell surface complex carbohydrate structures that 
are synthesized through the actions of glycosyltrans- 
ferases play an important role in cell-to-cell and cell- 
to-extracellular matrix interactions. To examine the 
feasibility of phage display technique to clone cDNAs 
encoding glycosyltransferases, we performed biopan¬ 
ning experiments using human histo-blood group A 
transferase as a model enzyme and its substrate, blood 
group H-speciflc glycoproteins, as a bait ligand. Our 
attempts have been unsuccessful, possibly because of 
the enzyme’s weak affinity with the target. However, 
we have selectively enriched several phage clones that 
expressed capsid proteins fused with galectin-3, a 
galactose/lactose-specific animal lectin of the galectin 
family. These results demonstrate that this novel ap¬ 
proach of phage display is useful in cDNA cloning of 
proteins with carbohydrate-binding property. ® 1999 

Academic Press 


In multicellular organisms, cells are organized in 
cooperative assemblies. Each cell is surrounded by 
other cells as well as a complex network of extracellu¬ 
lar matrix. Cell surfaces are polymorphic depending on 
the cell types as well as on the genotypes of individuals. 
In addition to the trafficking of nutrients and waste 
elimination, cell surfaces are also important for the 
communication and adhesion. On the outer membrane, 
complex carbohydrate structures are present as parts 
of glycoproteins and glycolipids. These carbohydrate 
structures are synthesized in Golgi apparatus through 
the actions of glycosyltransferases. Alterations in these 
structures have been reported during cellular differen¬ 
tiation as well as transformation to malignancy (1, 2). 

In the past several years, the roles of carbohydrates 
in inflammation have been intensively studied (3). 
Genes encoding three adhesion proteins that bind 
to neutrophiles/monocytes or endothelial cells were 
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cloned. These proteins all exhibited similar structures 
containing a carbohydrate-recognition domain (CRD), 
therefore, they were collectively named selectins. Se- 
lectins were later shown to bind to oligosaccharides 
with certain structures represented by sialyl Le x (4-6). 
This oligosaccharide-selectin interaction plays an ini¬ 
tial role (rolling effect) in the adhesion of leukocytes to 
the endothelial cells (7, 8). Some carcinomas and leu¬ 
kemias exhibit abundant sialyl Le x expression on their 
cell surfaces and use the same binding mechanism for 
attachment to endothelial cells during metastasis (9). 
A direct correlation between the sialyl Le x expression 
and prognosis of the patients with colon carcinomas 
was reported (10). 

In addition to selectins, cells also use other lectins for 
interaction with carbohydrates. For example, galec- 
tins, members of S-type lectin family (11, 12), possess 
strong affinity with blood group-specific oligosaccha¬ 
rides (13, 14). In vitro synthesized human galectin-3 
was shown to bind to the desialylated glycoprotein 
asialofetuin and to laminin, a major component of 
basement membranes (15). Decreased expression and 
loss of its nuclear localization during neoplastic pro¬ 
gression of colon carcinoma were reported (16). A cor¬ 
relation was noticed between the message levels of this 
lectin and metastatic potentials of K-1735 melanoma 
and UV 2237 fibrosarcoma sublines (17). Expression of 
galectin-1, another member of galectins, on endothelial 
cells has been implicated in apoptosis during T-cell 
development (18). The binding of galectin-1 with 
N-glycans on thymocytes appeared to be crucial in this 
process. A murine S-type lectin, mGBP, works as an 
autocrine negative growth factor (19). 

For the past nine years we continuously studied the 
molecular genetic basis of histo-blood group ABO sys¬ 
tem. ABH(O) antigens are carbohydrate antigens of 
clinical importance in blood transfusion and transplan¬ 
tation medicine. We cloned ABO genes and elucidated 
the molecular genetic basis of ABO system (20, 21). A 
and B alleles encode glycosyltransferases that possess 
different donor nucleotide-sugar specificity (22). A 
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transferase transfers GalNAc (iV-acetyl-D-galactosamine) 
residue to the substrate H by alpha 1—>3 glycosidic 
linkage, whereas B transferase transfers D-galactose. 
We used the A transferase as a model enzyme, and 
examined the feasibility of cDNA cloning of glycosyl- 
transferases by a novel technique employing phage 
display. 

The phage display technology is based on the surface 
expression of the peptide sequences fused with phage 
capsid protein. Until recently, most applications were 
restricted to clone phage particles expressing variable 
domains of antibodies specific to certain antigens (23- 
25). Phage display peptide libraries made with syn¬ 
thetic oligonucleotides have recently been utilized to 
identify peptide sequences that interact with a variety 
of bait ligands, such as proteins, peptides, DNAs, 
RNAs, and oligonucleotides (26, 27). In addition to the 
in vitro selections, their applications have been ex¬ 
tended to in vivo selections (28, 29). Surface display 
was also used for epitope mapping (30). Coat protein 
gene III of filamentous phage has been commonly used 
for phage display, but other fusion vectors such as 
flagellin gene (fliC) of E. coli and capsid protein gene 
10 of T7 phage have become available. While the 
C-terminus of the peptide sequence is fused with gene 
III capsid protein in the filamentous phage system 
(N-terminus of the peptide is linked to pelB leader 
sequence), the N-terminus of peptide sequence fuses 
with gene 10 capsid protein in the T7 system. With this 
T7 system, it is now possible to produce the phages 
that display protein sequences up to 1200 amino acid 
residues long. 

We constructed a phage display cDNA library using 
RNA from cells that stably expressed A transferase, 
and performed biopanning experiments using, as a bait 
ligand, crude mucin fraction containing blood group 
H-specific glycoproteins. So far no enrichment of the 
phages that expressed A transferase fusion protein has 
been obtained. Unexpectedly, however, selective aug¬ 
mentation was observed of the phages that expressed 
the fusion proteins with galectin-3, a soluble /3-galac- 
toside-binding (S-type) lectin. Because of this lectin’s 
known affinity with the blood group-specific oligosac¬ 
charides (13, 14), these results demonstrated, for the 
first time, that the phage display was successful in 
cloning cDNAs encoding a protein with binding capac¬ 
ity to carbohydrates. 

EXPERIMENTAL PROCEDURES 

Construction of a phage display cDNA library. We constructed a 
phage display cDNA library in bacteriophage T7Select System. HeLa 
cell transformant that stably expressed large amounts of A antigens 
was used for an RNA source. Total RNA was prepared with RNA 
STAT-60 reagents (TEL-TEST, Friendswood, TX), and poly A+ RNA 
was selected by oligo-dT affinity chromatography. Directional cDNA 
library was constructed by using Directional RH Random Primer 
cDNA Synthesis Kit and T7Select 1-lb Cloning Kit (Novagen, Mad¬ 


ison WI). After in vitro packaging, a small aliquot was used to titrate 
the library. The remaining library was amplified prior to biopanning. 
Amplified library was also titrated. 

Crude extraction of mucins. The human stomach mucosa of blood 
group O secretor individuals were used to prepare crude mucin 
fraction that contained blood group H-specific glycoproteins (31). 
Mucosa was homogenized, digested with pepsin, and ethanol precip¬ 
itated. After phenol extraction, soluble fraction was precipitated 
with 10% ethanol. 

Biopanning protocol. The MaxiSorp 96 well plate (Nunc, Roches¬ 
ter, NY) was coated for two hours at room temperature with 50 pi of 
PBS solution containing 5 ng, 50 ng, 500 ng, 5 pg, or 50 pg of crude 
mucin. After the blocking with PBS containing 3% BSA overnight at 
room temperature, the wells were washed with PBS. Next, the 
glycoprotein-coated wells were incubated for 2 hours with 5.3 X 10 s 
phages in Tris-buffered saline (TBS) supplemented with 20 mM 
MnCl 2 and 0.05% Tween 20, followed by washing with TBS. Then the 
overnight culture of BLT 5403 strain of E. coli was added, the wells 
were incubated for an hour, the contents were transferred into 
L-Broth supplemented with 50 pg/ml carbenicillin and additional 
overnight culture of BLT 5403, and the phages were grown until lysis 
occurred. Phage lysate was then diluted. The entire procedures were 
repeated two more times. 

PCR amplification and DNA sequencing. To determine the en¬ 
richment effect of biopanning, phage lysates containing selected 
populations of phages from each round of biopanning were used as 
templates for PCR amplification. T7 UP and DOWN primers (Nova¬ 
gen) corresponding to the sequences in the phage vector DNA that 
neighbor the inserts were used to amplify cDNA sequences. PCR was 
performed in 20 pi of a reaction mixture containing IX PCR buffer 
(10 mM Tris-HCl (pH 8.3), 50 mM KC1, 2.5 mM MgCl 2 , 0.001% 
Gelatin), 10% DMSO, 250 ju.M dNTP mix, 40 pmol each of primers, 
and 1 units of Taq DNA polymerase (BRL-Life Technologies, Gaith¬ 
ersburg, MD). Amplified fragments were analyzed through 3% aga¬ 
rose gel electrophoresis, gel purified with GeneClean II Kit (BIO 101, 
Vista, CA), and subjected to DNA sequencing using PRISM Dye 
Terminator Cycle Sequencing Ready Reaction Kit (PE-ABI, Foster 
City, CA) directly and after T-A cloning into the plasmid vector, 
pT-Adv (Clontech, Palo Alto, CA). 

Probe preparation and hybridization. Plasmid DNA containing 
the galectin-3 cDNA sequence was digested with EeoRI. The insert 
DNA was gel-purified after 3% agarose gel electrophoresis. Random 
hexamer labeling method was employed to radiolabel the fragment 
using 32 P-dCTP (NEN-Dupont, Boston, MA) and Random Primed 
DNA Labeling Kit (Boehringer-Mannheim, Indianapolis, IN). PCR- 
amplified DNA from the selected populations of phages analyzed 
through agarose gel electrophoresis as above was denatured, neu¬ 
tralized, Southern-transferred onto a Nylon membrane, and UV 
cross-linked. Approximately 500 phages were plated from each se¬ 
lection, plaque-transferred onto Nylon membranes and processed. 
These membrane filters were hybridized overnight with the 
galectin-3 cDNA probe in Pipes buffer (40 mM Pipes (pH 6.4), 1 mM 
EDTA (pH 8.0). 0.4 M NaCl) with 80% formamide and 1% SDS at 
42°C. The filters were then washed and exposed to the X-ray film. 

RESULTS 

We constructed a HeLa cell transformant phage dis¬ 
play cDNA library in T7Select 1-lb vector. Most, if not 
all, of the glycosyltransferases, including A trans¬ 
ferase, possess their catalytic domains at the 
C-terminal side (300-550 amino acid residues depend¬ 
ing on the enzymes) of the proteins. Although the 
transmembrane domain is important for the Golgi lo¬ 
calization, several experimental data suggested that 
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(A) 


M 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 M 



(B) 


FIG. 1. PCR amplification of cDNA inserts from phage popula¬ 
tions selected by biopanning. (A). Phage lysates were heat-denatured 
before being used as templates for PCR. Lysates used are from; 
original phage display HeLa cell transformant cDNA library (lane 1), 
phage populations obtained after one (lane 2), two (lane 3), and three 
(lane 4) rounds of selections with 5 ng, or after one (lane 5), two (lane 
6), and three (lane 7) rounds of selections with 50 ng, or after one 
(lane 8), two (lane 9), and three (lane 10) rounds of selections with 
500 ng, or after one (lane 11), two (lane 12), and three (lane 13) 
rounds of selections with 5 /xg, or after one (lane 14), two (lane 15), 
and three (lane 16) rounds of selection s with 50 pg of blood group- 
specific glycoproteins. Lane 17 shows the result of PCR without 
lysate as a negative control. M denotes 100-bp DNA ladder marker. 
(B). DNA was transferred from the above gel onto a Nylon membrane 
filter, and the filter was hybridized with a 32 P-radiolabeled galectin-3 
specific probe. A total of five bands were hybridized with the probe. 


ing a radiolabeled A transferase cDNA probe, however, 
showed no signs of enrichment (data not shown). Then 
we examined whether certain populations of phages 
might be enriched during these biopanning procedures. 
Using the heat-denatured phage lysate as a template 
for PCR, we amplified cDNA inserts from the selected 
populations of phages, and observed enrichment of cer¬ 
tain phages as judged by the increased band intensity 
through the rounds of selections. 

Enrichment was observed irrespective of the target 
concentrations (Fig. 1A). A single major band was am¬ 
plified from the phage population obtained after three 
rounds of selections with 50 /xg of the target ligand. 
When 5 pg and 500 ng of targets were used, three and 
two major bands were amplified, respectively. Enrich¬ 
ment was also seen with the phage populations se¬ 
lected with lesser concentrations of the targets. Nucle¬ 
otide sequences of these PCR-amplified bands were 
determined by direct sequencing as well as after clon¬ 
ing into a plasmid vector. Surprisingly, several of these 
sequences were derived from the cDNA sequence en¬ 
coding galectin-3, a member of carbohydrate-binding 
proteins known as lectin. Nucleotide and the deduced 
amino acid sequences of those bands from galectin-3 
cDNA are schematically shown around the fusion junc¬ 
tion in Fig. 2. Although the locations and sizes of the 
inserts varied, their reading frames were identical to 
that of the fused partner T7 capsid protein. Their 
C-terminal ends were conserved. 

Galectin-3, also known as IgE-binding protein (epsilon- 
BP), carbohydrate-binding protein 35 (CBP 35), or as 
macrophage cell-surface protein Mac-2, was originally 
identified in rat basophilic leukemia cells by virtue of its 


this signal/anchor domain and the N-terminal cyto¬ 
plasmic domain were not required for the enzymatic 
activity (32). T7Select 1-lb display vector can display 
on the surface of T7 capsid proteins up to 1,200 amino 
acid residues in size. Target sequences are fused to the 
C- terminus of the 10B capsid protein. Therefore, this 
vector was chosen to assess the feasibility of phage 
display approach to clone glycosyltransferase cDNAs. 

For an RNA source, we used HeLa cell transformant 
that expressed A transferase. The library contained 
5.3 X 10 6 independent phage clones, and the average 
insert size was 760 bp. Because only 0.1-1 copies of 
fusion proteins are displayed on T7Select 1-lb phages, 
one hundred times the size of library (5.3 X 10 8 phages) 
was used for the initial round of biopanning selections 
with blood group glycoproteins, the substrate for A 
transferase enzymatic reaction, as a target. 

We anticipated the enrichment of phages that dis¬ 
played the A transferase structure through the rounds 
of selections. Results from the plaque hybridization 
experiments of the selected populations of phages us- 


Phages 


298 

£ GCC TAC CCT 
100 

Ala Tyr Pro 


GCC ACT GGC CCC TAT ... 


Ala Thr Gly Pro Tyr ... 


AAT TCA AGC GCC AGC AGC 
1 

GAG CGG AAA ATG GCA GAC 
1 

Met Ala Asp 


CGT CCG GAG CCA GCC AAC 


AAT TTT TCG CTC CAT ... 


Asn Phe Ser Leu His ... 


232 

AAT TCA AGC GTC TAC CCA 
78 

Val Tyr Pro 
346 

AAT TCA AGC GTG CCT TAT 
116 

Val Pro Tyr 


GGG CCA CCC AGC GGC ... 


Gly Pro Pro Ser Gly ... 


AAC CTG CCT TTG CCT ... 


Asn Leu Pro Leu Pro ... 


FIG. 2. Nucleotide and the deduced amino acid sequences of phage 
clones containing galectin-3 cDNA sequence. Nucleotide and the de¬ 
duced amino acid sequences from the four phage clones that expressed 
galectin-3 fusion proteins are schematically shown around the Eco RI 
fusion junctions with gene 10 B in the T7Select 1-lb Vector. The A 
residue of initiation codon and the initiation codon are numbered 1 in 
the nucleotide and the deduced amino acid sequences, respectively. The 
sequences derived from the Eco RI linker are underlined. 
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5 ng 50 ng 500 ng 5 jig 50 jig 



FIG, 3. Hybridization of phage plaques with a galectin-3 probe. Phages from selections were plated at approximately 500 plaques per 
plate. After plaque-transfer onto Nylon membrane, the membranes were hybridized with the galectin-3 probe. 


affinity with IgE (33). Human galectin-3 possesses an 
open reading frame of 750 base pairs encoding a 250- 
amino-acid protein. The N-terminal 41 amino acids are 
homologous to a serum response transcription factor, and 
the adjacent sequence (aa 42-106) contains the repetitive 
proline-glycine-rich motif. The C-terminal domain con¬ 
tains CRD, a sequence motif shared by other S-type lec¬ 
tins. This domain is responsible for the binding, and is 
well conserved among different species of mammals (34). 
Although all S-type lectins bind /3-galactoside, the speci¬ 
ficity of each lectin seems to be unique (13, 35). Our 
results confirmed the ability of galectin-3 to bind the 
blood group structures (13, 14). 

We examined the selective enrichment of phages 
expressing galectin-3 fusion proteins during biopan¬ 
ning by two methods; the Southern hybridization and 
the plaque hybridization. DNA was transferred from 
the agarose gel, shown in Fig. 1A, onto a Nylon mem¬ 
brane, and the membrane was hybridized with a radio- 
labeled galectin-3 cDNA probe. The result is shown in 
Fig. IB. Five bands (one, three, and one from selections 
with 50 p,g, 5 /ng, and 500 ng of target ligand, respec¬ 


tively) were hybridized with the probe. Lysates, con¬ 
taining selected populations of phages, were plated at a 
density of approximately 500 phages per plate. DNA 
was transferred onto Nylon membranes, and these 
membranes were used for the plaque hybridization 
with the galectin-3 probe. Results from the plaque hy¬ 
bridization experiments are shown in Fig. 3. The num¬ 
bers of the hybridized plaques were counted, enrich¬ 
ment factors were calculated, and the results are 
shown in Table 1. These two hybridization experiments 
clearly demonstrated the selective amplification of 
phages that expressed galectin-3 structures. The re¬ 
sults also showed that the selection was target concen¬ 
tration dependent. Although the enrichment was ob¬ 
served even when the bait ligand was used in lower 
concentrations, the sequences from the enriched 
phages were not related to the galectin-3 sequence. 

DISCUSSION 

We obtained the results that demonstrate the feasi¬ 
bility of phage display technology for the purposes of 
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TABLE I 


Enrichment of Phages That Express 
Galectin-3 Fusion Proteins 


Target ligand 
amount 

Round of 
selections 

No. of 
total 
plaques 

No. of 
galectin-3 
positive 
plaques 

Enrichment 

Original library 

None 

8122 

1 

1 

5 ng 

1st 

412 

0 

— 


2nd 

642 

0 

— 


3rd 

602 

0 

— 

50 ng 

1st 

502 

0 

— 


2nd 

519 

0 

— 


3rd 

493 

0 

— 

500 ng 

1st 

301 

0 

— 


2nd 

391 

2 

41.6 


3rd 

625 

247 

3210.4 

5 Mg 

1st 

610 

20 

266.3 


2nd 

650 

267 

3336.9 


3rd 

610 

525 

6991.5 

50 Mg 

1st 

374 

16 

347.5 


2nd 

312 

83 

2161.1 


3rd 

536 

359 

5440.9 


Note. Enrichment of phages expressing galectin-3 fusion protein 
was examined by plaque hybridization of phages from each round of 
selection. 


detecting carbohydrate-protein interaction and cloning 
phage particles displaying fusion proteins with carbo¬ 
hydrate affinity. The experiments were originally in¬ 
tended to evaluate the capability of cloning cDNAs 
encoding glycosyltransferases using this novel technol¬ 
ogy. We examined whether the phages that expressed 
A transferase fusion protein could be enriched by serial 
rounds of biopannings to the target containing blood 
group H-specific oligosaccharide structures, the sub¬ 
strates for this enzyme. Rather than A transferase, we 
observed enrichment of phages that expressed 
galectin-3 fusion proteins after the selections with rel¬ 
atively higher concentrations of target. This enrich¬ 
ment was specific, without doubt, because multiple 
number of phage clones containing different portions of 
the galectin-3 were obtained. Enrichment was also ob¬ 
served with lower concentrations of target, however, it 
remains to be resolved whether or not the enrichment 
was meaningful because multiple numbers of clones 
containing different portions of the identical sequences 
were not obtained in these cases. 

Enrichment of phages that express galectin-3 was 
unexpected. However, considering the reported strong 
affinity of certain S-type lectins to the blood group 
structures, this may not be astonishing. Nonetheless, 
the phage display cDNA cloning was shown to be suc¬ 
cessful in targeting at the proteins with carbohydrate 
affinity. The same approach may be useful to clone 
cDNAs that encode other carbohydrate-binding pro¬ 
teins, using the appropriate target ligands for selec¬ 


tions. If the binding/washing conditions are optimized, 
cDNA cloning of proteins with lower affinity may be 
possible. Various modifications of conditions may be 
workable. The temperature for phage binding can be 
changed. Concentration of calcium and other cations 
may be critical for those proteins that depend on these 
ions for binding. Phage display system allows both 
evaluation of the presence of other types of lectins and 
the determination of their identity. Different from gly¬ 
cosyltransferases, however, some lectins, such as selec- 
tins, possess CRD at the N-terminus. Therefore, use of 
two complementary vector systems that permit the 
fusion at both the N- and the C-terminus may become 
necessary. 

Selection of glycosyltransferase cDNAs by phage dis¬ 
play approach seems to be difficult, even if not impos¬ 
sible, especially when competitive phages that exhibit 
fusion proteins, such as lectins, with stronger binding 
properties to the same target are present in the library. 
The messages for lectins are generally more abundant 
than those for glycosyltransferases. Therefore, the 
source of RNA for the library construction may be 
crucial. Human A transferase was previously purified 
from lungs to the homogeneity through several steps of 
procedures including Triton X-100 extraction, Sepha- 
rose 4B (UDP) chromatography, cation-exchange chro¬ 
matography, and reverse phase chromatography (36). 
Because we were unsure whether the phages would 
tolerate these procedures and because we had crude 
mucin fraction with blood group H activity, we took a 
shortcut approach by utilizing the affinity of the en¬ 
zyme to the substrate. Chromatographic steps similar 
to some of those used in enzyme purification may be 
applied to isolate the phages displaying A transferase. 
Use of high copy number display T7Select 415b Vector 
may increase the valence and enhance the phage’s 
affinity with the ligand. This vector, however, can only 
incorporate a maximum of 39 amino acids, which may 
be insufficient to code for CRD when taking into ac¬ 
count the general belief that about 100 amino acid 
residues are necessary to form a self-contained sugar¬ 
binding site (12). Future success of cloning glycosyl¬ 
transferase cDNAs by phage display, therefore, may 
heavily depend on the development of new vectors that 
display multiple copies of long protein sequences. 

Lectins are involved in pathological as well as phys¬ 
iological phenomena, and may also play an important 
role in progression and metastasis of cancer. We have 
shown that this novel approach of phage display is 
useful in identifying proteins with carbohydrate¬ 
binding properties. Using this technique, new types of 
lectins, if any, may be identified, CRD epitopes may be 
mapped, and our understanding of the structures and 
functions of these molecules involved in carbohydrate- 
protein interactions may be greatly advanced. 
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